® 




Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(jj) Publication number: 



0117058 

B1 



® 



EUROPEAN PATENT SPECIFICATION 



® Date of publication of patent specification: 20.09.89 
® Application number: 84300298.1 
® Date of filing: 18.01.84 



® int. ci. 4 : C 12 N 15/00, C 12 P 21/02, 
C 12 N 5/00 //C12R1/91 



(&) Methods for producing mature protein in vertebrate host cells. 



® Priority: 19.01.83 US 459152 

p 

(§) Date of publication of application: 
29.08.84 Bulletin 84/35 

(§) Publication of the grant of the patent: 
20.09.89 Bulletin 89/38 - 

@ Designated Contracting States: 
AT BE CH DE FR GB IT U LU NL SE 



00 

o 



o 

Q. 



References cited: 
EP-A-0 009 930 
EP-A-0 093 619 
EP-A-0 117 059 
EP-A-0 117 060 
EP-A-0 123 372 
WO-A-81/02425 
WO-A-81/02425 

CHEMICAL ABSTRACTS, vol. 96, no. 21, 24th 
May 1982, page 174, no. 175313p, Columbus, 
Ohio, US; G. RINGOLD et al.: "Coexpression 
and amplification of dihydrofolate reductase 
cDMA and the Escherichia coli XGPRT gene in 
Chinese hamster ovary cells'' & J. MOL APPL. 
GENET. 1981, 1(3), 165-75 



® Proprietor: GENENTECH, INC. 
460 Point San Bruno Boulevard 
South San Francisco California 94080 (US) 



@ Inventor: Levinson, Arthur David 
3690 Ralston Avenue 
Hillsborough California 94010 (US) 
Inventor: Simonsen, Christian Clinton 
1509 Parkview Avenue 
San Jose California 95310 (US) 



3) Representative: Armitage, Ian Michael et al 
MEWBURN ELLIS & CO. 2/3 Cursitor Street 
London EC4A1BQ (GB) 

(55) References cited: 

DNA, vol. 1, no. 3, 1st July 1982, pages 213-221, 
Mary Ann Liebert, Inc., Publishers; CHUNG- 
CHENG LIU et al.: Direct expression of 
hepatitis B surface antigen in monkey cells 
from an SV40 vector" 

EUKARYOTIC VIRAL VECTORS CONFERENCE^ 
1982, pages 7-12, Yakov Gluzman, Cold Spring 
Harbor Lab., Cold Spring Harbor, N.Y., US; D.H. 
HAMER et al.: "Induction of a mouse 
metallothionein-l gene in animal virus vectors" 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person may 
give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall 
be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been 
paid. (Art. 99(1 ) European patent convention). 



Courier Press, Leamington Spa, England. 



EP 0117 058 B1 

(§) References cited: 

PROCEEDINGS OF THE NATIONAL ACADEMY 
OF SCIENCES USA, vol. 77, no. 6, June 1980, 
pages 3567-3570; M. WIGLER et al.: 
Transformation of mammalian cells with an 
amplifiable dominant-acting gene" 
Eukoryotic vital vectors, ed. by Y. Gluzman 
CSHL 1982, p. 41-45, p. 193-98, p. 145-51 
Mol. and Cell Biol. (1981), p. 854-64 



5 



EP 0117 058 B1 



Description 

Background of the invention 

This invention relates to the application of recombinant DNA technology to the production of 
s polypeptide in vertibrate cell cultures. More specifically, this invention relates to utilizing the coding, 
sequence for a secondary control polypeptide as a tool in controlling production of a foreign polypeptide 

by the vertebrate ceil culture. 

The general principle of utilizing a host cell for the production of a heterologous protein— i.e., a protein 
which is ordinarily not produced by this cell— is well known. However, the technical difficulties of obtaining 

10 reasonable quantities of the heterologous protein by employing vertebrate host cells which are desirable 
by virtue of their properties with regard to handling the protein formed are many. There have been a 
number of successful examples of incorporating genetic material coding for heterologous proteins into 
bacteria and obtaining expression thereof. For example, human interferon, desacetyl-thymosin alpha-1, 
somatostatin, and human growth hormone have been thus produced. Recently, it has been possible to 

is utilise non-bacterial hosts such as yeast cells (see, e.g., co-pending application, U.S. Serial No. 237,913, 
filed February 25, 1981; EPO Publication No. 0060057) and vertebrate cell cultures (U.S. Application Serial 
No. 298,235, filed August 31, 1981; EPO Publication No. 0073656) as hosts. The use of vertebrate cell 
cultures as hosts in the production of mammalian proteins is advantageous because such systems have 
additional capabilities for modification, glycosylation, addition of transport sequences, and other 

20 subsequent treatment of the resulting peptide produced in the cell. For example, while bacteria may be 
successfully transfected and caused to express "alpha thymosin", the polypeptide produced lacks the 
N-acetyl group of the "natural" alpha thymosin found in mammalian system. 

In general, the genetic engineering techniques designed to enable host cells to produce heterologous 
proteins include preparation of an "expression vector" which is a DNA sequence containing, 

25 (1 ) a "promoter", i.e., a sequence of nucleotides controlling and permitting the expression of a coding 
sequence; 

(2) a sequence providing mRNA with a ribosome binding site; 

(3) a "coding region" i.e., a sequence of nucleotides which codes for the desired polypeptide; and 

(4) a "termination sequence" which permits transcription to be terminated when the entire code for the 
30 desired protein has been read; and 

(5) if the vector is not directly inserted into the genome, a "replicon" or origin of replication which 
permits the entire vector to be reproduced once it is within the cell. 

In the construction of vectors for the method of the present invention, the same promoter controls two 
coding sequences, one for a desired protein, and the other for a secondary protein. Transcription 

35 termination is also shared by these sequences. However, the proteins are produced in discrete form 
because they are separated by a stop and start translational signal. 

Ordinarily, the genetic expression vectors are in the form of plasmids, which are extrachromosomal 
loops of double stranded DNA. These are found in natural form in baceteria, often in multiple copies per 
cell. However, artificial plasmids can also be constructed, (and these, of course, are the most useful), by 

40 splicing together the four essential elements outlined above in proper sequence using appropriate 
"restriction enzymes". Restriction enzymes are nucleases whose catalytic activity is limited to lysing at a 
particular base sequence, each base sequence being characteristic for a particular restriction enzyme. By 
artful construction of the terminal ends of the elements outlined above (or fractions thereof) restriction 
enzymes may be found to splice these elements together to form a finished genetic expression vector. 

45 It then remains to induce the host cell to incorporate the vector (transfection), and to grow the host 
cells in such a way as to effect the synthesis of the polypeptide desired as a concomitant of normal growth. 

Two typical problems are associated with the above-outlined procedure. First, it is desirable to have in 
the vector, in addition to the four essential elements outlined above, a marker which will permit a 
straightforward selection for those cells which have, in fact, accepted the genetic expression vector. In 

so using bacterial cells as hosts, frequently used markers are resistance to an antibiotic such as tetracycline or 
ampicillin. Only those cells which are drug resistant will grow in cultures containing the antibiotic. 
Therefore, if the cell culture which has been sought to be transfected is grown on a medium containing the 
antibiotic, only the cells actually transfected will appear as colonies. As the frequency of transformation is 
quite low (approximately 1 cell in 10 6 being transfected under ideal conditions), this is almost an essential 

55 prerequisite as a practical matter. 

For vertebrate cells as hosts, the transformation rate achieved is more efficient (about 1 cell in 10 ). 
However, facile selection remains important in obtaining the desired transfected cells. Selection is 
rendered important, also, because the rate of cell division is about fifty times lower than in baceterial 
cells — |.e. r although £ colt divide once in about every 20—30 minutes, human tissue culture cells divide 

eo- only once in every 12 to 24 hours. 

The present invention, in one aspect, addresses the problem of selecting for vertibrate cells which have 
taken up the genetic expression vector for the desired protein by utilizing expression of the coding 
sequence for a secondary protein, such, for example, as an essential enzyme in which the host cell is 
deficient. For example, dihydrofolate reductase (DHFR) may be used as a marker using host cells deficient 

65 in DHFR. 
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A second problem attendant on production of polypeptides in a foregin host is recovery of satisfactory 
quantities of protein. It would be desirable to have some mechanism to regulate, and preferably enhance, 
the production of the desired heterologous polypeptide, in a second aspect of the invention, a secondary 
coding sequence which can be affected by externally controlled parameters is utilized to allow control of 
5 expression by control of these parameters. Furthermore, provision of both sequences on a polycistrom in 
itself permits selection of transformants with high expression levels of the primary sequence. 

It has been shown that DHFR coding sequences can be introduced into, expressed in, and amplified in 
mammalian cells. Genomic DNAfrom methotrexate resistant Chinese Hamster Ovary (CHO) cells has been 
introduced into mouse cells and results in transformants which are also resistant to methotrexate (1). The 

10 mechanism by which methotrexate (MTX) resistance in mouse cells is developed appears to be threefold: 
through gene amplification of the DHFR coding sequence (2, 3, 4); through decrease in uptake of MTX (5, 6) 
and through reduction in affinity of the DHFR produced for MTX (7). 

It appears that amplification of the DHFR gene through MTX exposure can result in a concommitant 
amplification of a cotransfected gene sequence. It has also been shown that mouse fibroblasts, transfected 

w with both a plasmid containing hepatitis B DNA sequences, and genomic DNA from a hamster cell line 
containing a mutant gene for MTX-resistant DHFR, secrete increased amounts of hepatitis B surface 
antigen (HBsAg) into the medium when MTX is employed to stimulate DHFR sequence amplification (8). 
Further, mRNA coding for the £ coli protein XGPRT is amplified in the presence of MTX in CHO cells 
co-transfected with the DHFR and XGPRT gene sequences under control by independent promoters (9). 

20 Finally, increased expression of a sequence endogenous to the promoter in a DHFR/SV40 pfasmid 
combination in the presence of MTX has been demonstrated (10). 

It is known that viruses which can infect vertebrate cells often have more than one coding sequence 
under the control of a single promoter. However, these coding sequences are not expressed 
simultaneously, but rather they selectively are brought under the control of the promoter for transcription 

25 by splicing of the mRNA, for which purpose specific splice sites (donor and acceptor regions) are 
recognisable in the sequence. Examples of this can be seen in Eukaryotic Viral Vectors, ed. Yakov Gluzman, 
CSHL, 1982, pp 145—151 and pp 193—198. 

However, there is no effective intervening splice site, so that translation of the second (downstream) 
coding sequence would not be expected. 

30 It is also known that in vertebrate cells sometimes translation is not initiated at the first AUG codon in 
mRNA, but rather from an AUG somewhat downstream, which may be internal to the open reading frame 
(see for example Subramain et al, Mol. and Cell. Biol., vol. 2, pp 854—864). 

The present invention is based on the discovery that, in vertebrate cell hosts, where the genetic 
expression vector for a desired polypeptide contains a secondary genetic coding sequence under the 

35 control of the same promoter but downstream of and separated from the primary coding sequence by 
translation stop and start codons and without any effective intervening splice site, nevertheless some 
transfectants may express both sequences, albeit the second more weakly than the first. This secondary 
sequence can therefore provide for a convenient screening marker, both for transformants in general, and 
for transformants showing high expression levels for the primary sequence, as well as serving as a control 

<o device whereby the expression of a desired polypeptide can be regulated, most frequently enhanced. 
This is particularly significant as the two proteins, according to the method of this, invention, are 
produced separately in mature form. While both DNA coding sequences are controlled by the same 
transcriptional promoter, so that a fused message (mRNA) is formed, they are separated by a translational 
stop signal for the first and start signal for the second, so that two independent proteins result. 

45 As a vertebrate host cell culture system is often advantageous because it is capable of glycosylation, 
phosphorylation, and lipid association appropriate to animal systems (whereas bacterial hosts are not), it is 
significant that marker systems and regulating systems can be provided within this context. 

The present invention concerns a method of selecting transfected vertebrate host cells for expression 
of a desired polypeptide through the use of a polycistronic expression vector which contains sequences 

so coding for a secondary protein and a desired protein, wherein both the desired and secondary sequences 
are governed by the same promoter. The coding sequences are separated by translational stop and start 
signal codons. The expression of the secondary sequence effects control over the expression of the 
sequence for the desired protein, and the secondary protein functions as a marker for selection of 
transfected cells. The invention includes use of secondary sequences having either or both of these effects. 

55 Brief description of the drawings 

Figure 1 shows the construction of an expression vector for HBsAg, pE342.HS94.HBV. 
Figure 2 shows the construction of an expression vector for DHFR, pE342.D22. 
Figure 3 shows the construction of expression vectors for DHFR and HBsAg, pE342.HBV.D22 and 
pE342.HBV.E400.D22. 

60 

Detailed description and description of the preferred embodiment 
A. Definitions 

As used herein 

"Plasmids" includes both naturally occuring plasmids in bacteria, and artificially constructed circular 
6s DNA fragments. 
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"Expression vector" means a plasmid which contains at least the four essential elements set forth 
hereinabove for the expression of the heterologous peptide in a host cell culture. 

"Heterologous protein" means a protein or peptide which is not normally produced by, or required for 

the viability of, the host organism. m 
* "Desired protein" means a heterologous protein or peptide which the method of the invention is 

designed to produce. ... ^ 

"Secondary peptide" means the protein or peptide which is not the heterologous peptide desired as 
the primary product of the expression in the host cell, but rather a different heterologous peptide which, by 
virtue of its own characteristics, or by virtue of the characteristics of the sequence coding for it is capable of 
10 "marking" transfection by the expression vector and/or regulating the expression of the primarily desired 
heterologous peptide. 

The peptide sequence may be either long or short ranging from about 5 amino acids to about 1000 
amino acids. The conventional distinction between the words peptide and protein is not routinely observed 
in the description of the invention. If the distinction is to be made, it will be so specified. 

is "Primary sequence" is the nucleotide sequence coding for the desired peptide, and 

"Secondary sequence" means a sequence of nucleotides which codes for the secondary peptide. 
"Transfection" of a host cell means that an expression vector has been taken up by the host cell in a 
detectable manner whether or not any coding sequences are in fact expressed. In the context of the present 
invention, successful transfection will be recognized when any indication of the operation of this vector 

20 within the host cell is realized. It is recognized that there are various levels of success within its context. 
First, the vector's coding sequence may or may not be expressed. If the vector is properly constructed with 
inclusion of promoter and terminator, however, it is highly probable that expression will occur. Second, if 
the plasmid representing the vector is taken up by the cell and expressed, but fails to be incorporated 
within the normal chromasomal material of the cell, the ability to express this plasmid will be lost after a 

2S few generations. On the other hand, if the vector is taken up within the chromosome, the expression 
remains stable through repeated replications of the host cell. There may also be an intermediate result. The 
precise details of the manner in which transfection can thus occur are not understood, but it is clear that a 
continuum of outcomes is found experimentally in terms of the stability of the expression over several 
generations of the host culture. 

30 

B. A preferred embodiment of the desired peptide 

In a preferred specific embodiment, exemplary of the invention herein, the primary genetic sequence 
encodes the hepatitis B-surface antigen (HBsAg). This protein is derived from hepatitis B virus, the infective 
agent of hepatitis B in human beings. This disease is characterized by debilitation, liver damage, primary 

35 carcinoma, and often death. The disease is reasonably widespread especially in many African and Asian 
countries, where many people are chronic carriers with the potential of transmitting the disease 
pandemically. The virus (HBV) consists of a DNA molecule surrounded by a nuclear capsid, in turn 
surrounded by an envelope. Proteins which are associated with the virus include the surface antigen 
(HBsAg), a core antigen, and a DNA polymerase. The HBsAg is known to produce antibodies in infected 

40 people. HBsAg found in the serum of infected individuals consists of protein particles which average ca. 22 
nanometers in diameter, and are thus called "22 nanometer particles". Accordingly, it is believed that the 
HBsAg particle would be an effective basis for a vaccine. , 

C. A preferred embodiment of the secondary peptide 

45 It has been recognized that environmental conditions are often effective in controlling the quantity of 
particular enzymes that are produced by cells under certain growth conditions. In the preferred 
embodiment of the present invention, advantage is taken of the sensitivity of certain cells to methotrexate 
(MTX) which is an inhibitor of dihydrofolate reductase (DHFR). DHFR is an enzyme which is required, 
indirectly, in synthesis reactions involving the transfer of one carbon units. Lack of DHFR activity results in 

so inability of cells to grow except in the presence of those compounds which otherwise require transfer of 
one carbon units for their synthesis. Cells lacking DHFR, however, will grow in the presence of a 
combination of glycine, thymidine and hypoxanthine. 

Cells which normally produce DHFR are known to be inhibited by methotrexate. Most of the time, 
addition of appropriate amounts of methotrexate to normal cells will result in the death of the cells. 

55 However, certain cells appear to survive the methotrexate treatment by making increased amounts of 
DHFR, thus exceeding the capacity of the methotrexate to inhibit this enzyme (2, 3, 4). It has been shown 
previously that in such cells, there is an increased amount of messenger RNA coding for the DHFR 
sequence. This is explained by assuming an increase in the amount of DNA in the genetic material coding 
for this messenger RNA. In effect, apparently the addition of methotrexate causes gene amplification of the 

so DHFR gene. Genetic sequences which are physically connected with the DHFR sequence although not 
regulated by the same promoter are also amplified (1, 8, 9, 10). Consequently, it is possible to use the 
amplification of the DHFR gene resulting from methotrexate treatment to amplify concomitantly the gene 
for another protein, in this case, the desired peptide. 

Moreover, if the host cells into which the secondary sequence for DHFR is introduced are themselves 

ss DHFR deficient, DHFR also serves as a convenient marker for selection of cells successfully transfected. If 
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the DHFR sequence is effectively connected to the sequence for the desired peptide, this ability serves as a 
marker for successful transfection with the desired sequence as well. 

D. Vector construction techniques employed {materials and methods) 

5 The vectors constructed in the Examples set forth in E are constructed by cleavage and ligation of 
isolated plasmids or DNA fragments. 

Cleavage is performed by treating with restriction enzyme (or enzymes) in suitable buffer. In general, 
about 20 |ig plasmid or DNA fragments require about 1 — 5 units of enzyme in 200 \x\ of buffer solution. 
(Appropriate buffers for particular restriction enzymes are specified by the manufacturer). Incubation times 
jo of about 1 hour at 37°C are workable. After incubations, protein is removed by extraction with phenol and 
chloroform, and the nucleic acid recovered from the aqueous fraction by precipitation with ethanol. 

If blunt ends are required, the preparation is treated for 15 minutes at 15° with 10 units of Polymerase I 
(Klenow), phenol-chloroform extracted, and ethanol precipitated. 

Size separation of the cleaved fragments is performed using 6 percent polyacrylamide gel described by 
15 Goeddel, D. et al., Nucleic Acids Res 8; 4057 (1980) incorporated herein by reference. 

For ligating approximately equimolar amounts of the desired components, suitably end tailored to 
provide correct matching are treated with about 10 units T4 DNA ligase per 0.5 pg DNA. 

E. Detailed description of a preferred embodiment: 

20 In general, the expression vector suitable for the present invention is constructed by adaptation of 
gene splicing techniques. The starting material is a naturally occuring bacterial plasmid, previously 
modified, if desired. A preferred embodiment of the present invention utilizes a pML plasmid which is a 
modified pBR 322 plasmid prepared according to Lusky, M. eta/.. Nature 239:73 (1981) which is provided 
with a single promoter, derived from the simian virus SV-40 and the coding sequence for DHFR and for 

25 HBsAg. 

In the construction, the promoter (as well as a ribosome binding sequence) is placed upstream from 
the coding sequence coding for a desired protein and one coding for a secondary protein. A single 
transcription termination sequence is downstream from both. At the end of the upstream code sequence is 
placed a translational stop signal; a transiational start signal begins the downstream sequence. Thus, 
30 expression of the two coding sequences results in a single mRNA strand, but two separate mature proteins. 

In a particularly preferred embodiment, the sequence coding for the secondary peptide is downstream 
from that coding for the desired peptide. Under these circumstances, procedures designed to select for the 
cells transformed by the secondary peptide will also select for particularly enhanced production of the 
desired peptide. 

35 

F. Examples 

The following examples are intended to illustrate, but not limit the invention- 
Example 1 

40 Vector containing the HBsAg sequence, pE342.HS94.HBV 

Figure 1 shows the construction of the HBsAg plasmid. 

The 1986 bp EcoRI-Bglll fragment which spans the surface antigen gene was isolated from the HBV 
viral genome cloned with pBR322 as described by Liu era/., DNA 7:213 (1982), incorporated herein by 
reference. This sequence was ligated between the EcoRI and BamHl sites of pML, a pBR322 derivative 

45 which lacks sequences inhibitory to its replication in simian cells, as described by Lusky et al, Nature 
293:79 (1981), incorporated herein by reference. Into the single EcoRI site of the resulting plasmid was 
inserted the 342 bp origin fragment of SV40 obtained by Hindlll Pvull digestion of the virus genome, which 
had been modified to be bounded by EcoRI restriction sites resulting in p342E (also referred to as 
pHBs348-E) as described by Levinson et a/., patent application Serial No. 326,980, filed December 3, 1981, 

so which is hereby incorporated by reference (EPO Publication No. 0073656). (Briefly, the origin of the Simian 
virus SV40 was isolated by digesting SV40 DNA with Hindlll, and converting the Hindlll ends to EcoRI ends 
by the addition of a converter (AGCTGAATTC). This DNA was cut with Pvull, and Rl linkers added. 
Following digestion with EcoRI, the 348 base-pair fragment spanning the origin was isolated by 
polyacrylamide gel electrophoresis and electroelution, and cloned in pBR322. Expression plasmid 

55 pHBs348-E was constructed by cloning the 1986 base-pair fragment resulting from EcoRI and Bglll 
digestion of HBV [Animal Virus Genetics, (CH. 5) Acad. Rress, N. Y. (1980) (which spans the gene encoding 
HBsAg) into the plasmid pML (Lusky et al., Nature 293:79, 1981) at the EcoRI and BamHl sites. (pML is a 
derivative of pBR322 which has deletion eliminating sequences which are inhibitory to plasmid replication 
in monkey cells). The resulting plasmid (pRI-Bgl) was then linearized with EcoRI, and the 348 base-pair 

qq fragment representing the SV40 origin region was introduced into the EcoRI site of pRI-Bgl. The origin 
fragment can insert in either orientation. Since this fragment encodes both the early and late SV40 
promoters in addition to the origin of replication, HBV genes could be expressed under the control of either 
promoter depending on this orientation (pHBS348-E representing HBs expressed under control of the early 
promoter). pE342 is modified by partially digesting with EcoRI, filling in the cleaved site using Klenow DNA 

5 5 polymerase I, and ligating the plasmid back together, thus removing the EcoRI site preceding the SV40 
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origin in pE342. The resulting plasmid, designated pE342AR1, is digested with EcoRI, filled in using Klenow 
DNA polymerase l # and subcut with BamHI. After electrophoresing on acrylamide gel, the approximately 
3500 bp fragment is electroeluted, phenol-chloroform extracted, and ethanol precipitated as above). The 5' 
nontranslated leader region of HBsAg was removed by treatment with EcoRI and with Xba, and the 

5 analogous 150 bp EcoRI-Xba fragment of a hepatitis expression plasmid pHS94 (Liu et al. (supra)) was 
inserted in its place to create pE342.HS94.HBV. 

(As described by Liu, et al. pHS94 contains the translational start codon of the authentic HBsAg gene, 
but lacks all 5' nontranslated message sequences. The levels of expression of both the authentic EcoRI-Bglll 
and pHS94 derived equivalent under control of the SV40 early promoter as described above are equivalent 

10 and are interchangeable without affecting the performance of the plasmid). 

Example 2 

Vector containing the DHFR sequence, pE342.D22 

A plasmid carrying DHFR as the only expressable sequence is pE348,D22, the construction of which is 

15 shown in Figure 2. 

The 1600 bp Pst I insert of the DHFR cDNA plasmid DHFR-11 (Nunberg et al., Cell /S:355, 1980) was 
treated with the exonuclease Bal31 in order to remove the poly G:C region adjacent to the Pst I sites, 
digested with Bglll and the resulting fragments of approximately 660 bp isolated from gels. The Bal31-Bgll1 
digested cDNA was ligated into a pBR322 plasmid derivative containing a Bglll site. (Following digestion of 

20 pBR322 with Hind III, the plasmid fragment was filled in using Klenow DNA polymerase in the presence of 
the four deoxynucleotide triphosphates, and subcut with Bglll). The resulting plasmid, pDHFR-D22, has an 
EcoRI site situated 29 bp upstream of the fusion site between pBR322 and the 5' end of the DHFR cDNA. The 
EcoR l-Bglll fragment encompassing the coding sequences of the cDNA insert was then excised from 
pDHFR-D22 and ligated to EcoRI-BamHI digested pE342.HBV (Example 1), creating the DHFR expression 

25 plasmid pE342.D22. 

Example 3 

Vectors containing both DHFR and HBsAg sequences 

Two such vectors were constructed, pE342.HBV.D22 containing a polycistron wherein the DHFR gene 
30 is downstream from the HBsAg gene, and pE342.HBV.E400.D22, (Figure 3) in which the genes coding for 
DHFR and HBsAg are not polycistronic. 

A. pE342.HBV.D22 was constructed by ligating the EcoRI-Taql fragment of cloned HBV DNA (Liu et al. 
(supra)), to EcoRI-Clal digested pE342.D22. 

B. This plasmid was further modified by fusing an additional SV40 early promoter between the Bglll 
35 site and the Clal site of the DHFR insert of pE342.HBV.D22, creating pE342.HBV.E400.D22. 

HBV viral DNA contains a single Taql site 20 bp beyond the Bglll site that was used to generate the 
EcoRI-Bglll fragment encompassing the surface antigen gene. Thus, EcoRI and Taql digestion of cloned 
HBV viral DNA results in a fragment of -2000 bp spanning the surface antigen gene, and containing a 
single Bglll site (1985 bp from the EcoRI site (Liu etal. (supra)). (The ends of DNA fragments Taql and Clal 

40 generated by digestion are cohesive, and will ligate together). 

The Clal site is regenerated; thus pE342.HBV.D22 contains both a Bglll and Clal site, which are situated 
immediately in front of the DHFR coding sequences. 

An SV40 origin bounded by restriction sites cohesive with the Bglll and Clal sites of pE342.HBV.D22 
was constructed by digesting SV40 DNA with Hpall, filling in as described above, and suborning with 

45 Hind!!!. A 440 bp fragment spanning the origin was isolated. This was ligated, in a tripartite ligation, to the 
4000 bp pBR322 fragment generated by Hindlll and BamHI digestion, and the 1986 bp fragment spanning 
the surface antigen gene generated by digesting the cloned HBV virai DNA with EcoRI, filling in with 
Klenow DNA polymerase 1, subdigesting with Bglll, and isolating on an acrylamide gel. Ligation of all three 
fragments is achievable only by joining of the filled in Hpall with EcoRI, the two Hindlll sites with each other 

so and the Bglll with BamHI. Thus when the resulting plasmid is restricted with Clal and BamHI, a 470 bp 
fragment is obtained which contains the SV40 origin. This fragment is inserted into the Clal and Bglll sites 
of pE342.HBV.D22, (paragraph A) creating pE342.HBV.E400.D22 (Figure 3). 

Example 4 
55 Transfection of host cells 

The host cells herein are vertebrate cells grown in tissue culture. These ceils, as is known in the art, can 

be maintained as permanent cell lines prepared by successive serial transfers from isolated normal cells. 

These cell lines are maintained either on a solid support in liquid medium, or by growth in suspensions 

containing support nutrients. 
60 In the preferred embodiment, CHO cells, which were deficient in DHFR activity are used. These cells are 

prepared and propagated as described by Urlaub and Chasin, Proc. Natl. Acad. Sci (USA) 77:4216 (1980), 

which is incorporated herein by reference. 

The cells are transfected with 5 mg of desired vector as prepared above using the method of Graham 

and Van Der Eb, Virology 52:456 (1978) incorporated herein by reference. 
6S The method insures the interaction of a collection of plasmids with a particular host cell, thereby 
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increasing the probability that if one plasmid is absorbed by a cell, additional plasmids would be absorbed 
as well. Accordingly, it is practicable to introduce both the primary and secondary coding sequences using 
separate vectors for each, as well as by using a single vector containing both sequences. 

5 Example 5 

Growth of transferred cells and expression of peptides 

The CHO ceils which were subjected to transfection as set forth above were first grown for two days in 
non-selective medium, then the cells were transferred into medium lacking glycine, hypoxanthine, and 
thymidine, thus selecting for cells which are able to express the plasmid DHFR. After about 1 — 2 weeks, 

10 individual colonies were isolated with cloning rings. 

Ceils were plated in 60 or 100 mm tissue culture dishes at approximately .5x 10 6 cells/dish. After 2 days 
growth, growth medium was changed. HBsAg was assayed 24 hours later by RIA (Ausria II, Abbott). Cells 
were counted and HBsAg production standardized on a per ceil basis. 10 — 20 random colonies were 
analyzed in this fashion for each vector employed. 

15 In one example of the practice of the invention, the following results were obtained: 
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10 6 cells) 




HBsAg production; ng/10 6 cells/day 
(percent of colonies in given range) 




Vector 


0 


0—10 10—100 


100—500 


500—1500 


>1500 


pE342.D22 


935 


100 


0 0 


0 


0 


0 


PE342.HS94 


<.2 


1 


1 1 


1 


• 

1 


1 


pE342.D22+pE342.HS94 


340 


0 


50 30 


20 


0 


0 


pE342.HBV.D22 


20 


0 


0 0 


0 


55 


45 


pE342.HBV.E400.D22 


510 


0 


17 17 


58 


8 


0 



The production of surface antigen in several of the highest expressing cell lines has been monitored for 
greater than 20 passages and is stable. The cells expressing the surface antigen remain attached to the 
35 substratum indefinitely and will continue to secrete the large amounts of surface antigen as long as the 
medium is replenished. 

it is clear that the polycistronic gene construction results in isolation of the cells producing the highest 
levels of HBsAg. 100 percent of colonies transformed with pE342.HBV.D22 produced over 500 ng/10 6 
cells/day whereas 92 percent of those transformed with the non-polycistrontc plasmid 
40 pE342.HBV.E400.D22 produced less than that amount. Only cells from the polycistronic transfection 
demonstrated production levels of more than 1500 ng/10 6 cells/day. 

Example 6 

Treatment with methotrexate 

4S The surface antigen expressing cell lines are inhibited by methotrexate (MTX), a specific inhibitor of 
DHFR at concentrations greater than 10 nM. Consistent with previous studies on the effects of MTX on 
tissue culture cells, occasional clones arise which are resistant to higher concentrations (50 nM) of MTX at a 
frequency of approximately 10" 6 . However, these clones no longer produce surface antigen despite the 
amplification of HBV sequences in the MTX resistant clones. Thus, the HBV gene is amplified, though 

so expression falls off in this case. This suggests that further production of surface antigen may be lethal to 
the cell. 

Example 7 

Recovery of desired peptide 

55 The surface antigen produced is in the form of a particle, analogous to the 22 nm particle observed in 
the serum of patients infected with the virus. This form of antigen has been shown to be highly 
immunogenic. When the cells are grown in medium lacking calf serum or other supplements, 
approximately 10 percent of the protein contained in the medium is surface antigen and this protein can be 
isolated by methods known in the art The surface antigen comigrates on SDS-poiyacrylamide gels with the 

6 o 22 nm particle derived protein. 
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10 Claims 

1. A method of selecting transfected vertebrate host cells for expression of a desired polypeptide, 
which method comprises transfecting the vertebrate cells with an expression vector comprising a promoter 
operable in a vertebrate host cell and first and second polypeptide coding sequences under the control of 

rs said promoter, said coding sequences being separated by a translational stop signal and a translational 
start signal without any intervening splice site which is effective in the host cell and selecting transfectants 
which exhibit expression of the second polypeptide accompanied by a higher level of expression of said 
first polypeptide. 

2. A method according to Claim 1, wherein the second polypeptide is capable of marking transfection 
20 by the expression vector and/or regulating the expression of the first polypeptide. 

3. A method according to Claim 1 or Claim 2 wherein the promoter is the SV40 early promoter. 

4. A method according to any one of the preceding claims wherein the transfectants are grown under 
selective culture conditions favouring expression of the second polypeptide. 

5. A method according to any one of the preceding claims wherein the second polypeptide is DHFR. 
25 6. A method according to Claim 5 wherein the host cells are deficient in DHFR. 

7. A method according to Claim 5 or Claim 6 wherein the transfected host cell is grown in the presence 
of a DHFR inhibitor. 

8. A method according to Claim 7 wherein the DHFR inhibitor is methotrexate. 

9. A method according to any one of the preceding claims wherein the host cells are CHO cells. 
30 10. A method for producing a desired polypeptide which comprises culturing transfected vertebrate 

host cells obtained according to any one of the preceding claims so as to express said first polypeptide 
coding sequence, said first polypeptide being the desired polypeptide. 

Patentanspruche 

35 

1. Verfahren zur Selektion von transfizierten Wirbeltier-Wirtszellen fur die Expression eines 
gewunschten Polypeptides, welches Verfahren das Transfizieren der Wirbeltierzellen mit einem 
Expressionsvektor, der einen Promoter, der in einer Wirbeltierwirtszelle operabel ist, und erste und zweite 
Polypeptidkodierungssequenzen unter der Steuerung des genannten Promotors enthalt, wobei die 

40 genannten Kodierungssequenzen durch ein Translationsstopsignal und ein Translationsstartsignal ohne 
intervenierende SpleiBstelle getrennt sind, und der in der Wirtszelle wirkt, und die Selektion von 
Transfektanten, die die Expression des zweiten Polypeptides, begleitet von einem hpheren 
Expressionswert des genannten ersten Polypeptides zeigen, umfaSt. 

2. Verfahren nach Anspruch 1, worin das zweite Polypeptid fahig ist, die Transfektion durch den 
45 Expressionsvektor zu markieren und/oder die Expression des ersten Polypeptides zu steuern. 

3. Verfahren nach Anspruch 1 oder 2, worin der Promoter der fruhe SV40 Promoter ist 

4. Verfahren nach einem der vorhergehenden Anspruche, worin die Transfektanten unter selektiven 
Kulturbedingungen kultiviert werden, die die Expression des zweiten Polypeptides fordern. 

5. Verfahren nach einem der vorhergehenden AnsprQche, worin das zweite Polypeptid DHFR ist. 
50 6. Verfahren nach Anspruch 5, worin die Wirtzsellen arm an DHFR sind. 

7. Verfahren nach Anspruch 5 oder 6, worin die transfizierte Wirtszelle in Gegenwart eines 
DHFR-lnhibitors kultiviert wird. 

8. Verfahren nach Anspruch 7, worin der DHFR Inhibitor Methotrexat ist 

9. Verfahren nach einem der vorhergehenden Anspruche, worin die Wirtzellen CHO-Zellen sind. 
5 s 10. Verfahren zur Herstellung eines gewunschten Polypeptides, welches das Kultivieren von 

transfizierten Wirbeleiter-Wirtszeilen umfaBt, die nach einem der vorhergehenden Anspruche erhalten 
wurden, urn die genannte erste Polypeptid-Kodierungssequenz zu exprimieren, wobei das genannte erste 
Polypeptid das gewunschte Polypeptid ist 



60 Revendications 

1. Procede de selection de cellules notes transferees de vertebres pour I'expression d'un polypeptide 
souhaite, lequel procede comprend la transfection des cellules de vertebres avec un vecteur d'expression 
comprenant un promoteur utilisable dans une cellule hdte de vertebre et des premiere et seconde 
55 sequences de codage de polypeptide sous le contr6le dudit promoteur, lesdites sequences de codage etant 
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separees par un signal d'arret de traduction et un signal de debut de traduction sans aucin site 
intermediate d'dpissage qui est efficace dans la cellule hdte et la selection de transfectants qui presentent 
I'expression du second polypeptide accompagnee d'un niveau superieur d'expression dudit premier 
polypeptide. 

5 2. Procede selon la revendication 1 ou le second polypeptide est capable de marquer la transfection par 
le vecteur d'expression et/ou de reguler I'expression du premier polypeptide. 

3. ProcedS selon la revendication 1 ou la revendication 2 ou le promoteur est le promoteur precoce de 
SV40. 

4. Procede selon I'une quelconque des revendications precedentes. ou . les transfectants sont 
70 developpes en conditions de culture selective favorisant I'expression du second polypeptide. 

5. Procede selon Tune quelconque des revendications precedentes ou le second polypeptide est DHFR. 

6. Procede selon la revendication 5 ou les cellules hdtes sont deficientes en DHFR. 

7. Procede selon la revendication 5 ou la revendication 6 ou la cellule hdte transferee est developpee 
en presence d'un inhibiteur de DHFR. 

w 8. Procede selon la revendication 7 ou I'inhibiteur de DHFR est le methotrexate. 

9. Procede selon I'une quelconque des revendications precedentes ou les cellules hdtes sont des 
cellules de CHO. 

10. Procede de production d'un polypeptide souhaite, qui comprend la mise en culture de cellules 
hdtes transferees de vertebres obtenues selon I'une quelconque des revendications precedentes afin 

20 d'exprimer la sequence codant ledit premier polypeptide, ledit premier polypeptide etant le polypeptide 
souhaite. 
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